Constraint-Based Search of Straddling Biclusters and Discriminative Patterns
نویسندگان
چکیده
The state-of-the-art Data-Peeler algorithm extracts closed patterns in n-ary relations. Because it refines a lower bound and an upper bound of the pattern space, Data-Peeler can, in some circumstances, guarantee that a region of the pattern space does not contain any closed n-set satisfying some relevance constraint, allowing the algorithm to not perform any further pattern search in that region. If it is so, this region is left unexplored and some time is saved. Not all constraints enable such a pruning of the pattern space but both the monotone and the anti-monotone constraints do. This article shows that a minimal (resp. maximal) cover of some arbitrary groups of elements is anti-monotone (resp. monotone). As a consequence, Data-Peeler may prune the search space with those constraints and efficiently discover many different patterns. For instance, it can list the so-called straddling biclusters, which cover at least some given portions of every group. It can also discover closed n-sets that discriminate a group from the others, what has potential applications to supervised classification.
منابع مشابه
Constraint-Based Search of Different Kinds of Discriminative Patterns
The state-of-the-art DATA-PEELER algorithm extracts closed patterns in n-ary relations. Because it refines both a lower and an upper bound of the pattern space, DATA-PEELER can, in some circumstances, guarantee that a region of that space does not contain any closed n-set satisfying some relevance constraint. Whenever it happens, such a region is unexplored and computation saved. This paper sho...
متن کاملDescoberta de n-conjuntos Fechados Eficiente e Restrita a Grupos de Interesse
The state-of-the-art Data-Peeler algorithm extracts closed patterns in n-ary relations. Because it refines a lower bound and an upper bound of the pattern space, Data-Peeler can, in some circumstances, guarantee that a region of the pattern space does not contain any closed n-set satisfying some relevance constraint. If it is so, this region is left unexplored and some time is saved. Not all co...
متن کاملApplication of Greedy Randomized Adaptive Search Procedure to the Biclustering of Gene Expression Data
Microarray technology demands the development of data mining algorithms for extracting useful and novel patterns. A bicluster of a gene expression dataset is a local pattern such that the genes in the bicluster exhibit similar expression patterns through a subset of conditions. In this study biclusters are detected in two steps. In the first step high quality bicluster seeds are generated using...
متن کاملApplication of Cardinality based GRASP to the Biclustering of Gene Expression Data
Biclustering algorithms perform simultaneous row and column clustering of a given data matrix. In gene expression dataset a bicluster is a subset of genes that exhibit similar expression patterns through a subset of conditions. Biclustering is a useful data mining technique for identifying local patterns from gene expression data. In this paper biclusters are identified in two steps. In the fir...
متن کاملA General Framework for Biclustering Gene Expression Data
A large number of biclustering methods have been proposed to detect patterns in gene expression data. All these methods try to find some type of biclusters but no one can discover all the types of patterns in the data. Furthermore, researchers have to design new algorithms in order to find new types of biclusters/patterns that interest biologists. In this paper, we propose a novel approach for ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JIDM
دوره 4 شماره
صفحات -
تاریخ انتشار 2013